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Audio coding 



The invention relates to coding at least part of an audio signal. 



10 



15 



In the art of audio coding, Linear Predictive Coding (LPC) is well known for 
representing spectral content. Further, many efficient quantization schemes have been 
proposed for such linear predictive systems, e.g. Log Area Ratios [1], Reflection Coefficients 
[2] and Line Spectral Representations such as Line Spectral Pairs or Line Spectral 
Frequencies [3, 4, 5]. 

Without going into much detail on how the filter-coefficients are transformed 
to a line Spectral Representation (reference is made to [6, 7, 8> 9, 10] for more detail), the 
results are that an M-th order alt-pole LPC filter H(z) is transformed to M frequencies, often 
referred to as Line Spectral Frequencies (L3F). These frequencies uniquely represent the 
filter H(z). As an example see Fig. 1, Note that for clarity the Line Spectral Frequencies have 
been depicted in Fig, 1 as lines towards the amplitude response of the filter, although they are 
nothing more than just frequencies, and thus do not in themselves contain any amplitude 
information whatsoever. 



An object of the invention is to provide advantageous coding of at least part of 
an audio signal. To this end, the invention provides a method of encoding, an encoder, an 
encoded audio signal, a storage medium, a method of decoding, a decoder, a transmitter, a 

20 receiver and a system as defined in the independent claims. Advantageous embodiments are 
.defined in the dependent claims. 

According to a first aspect of the invention, at least part of an audio signal is 
coded in order to obtain an encoded signal, the coding comprising predictive coding the at 
least part of the audio signal in order to obtain prediction coefficients which represent 

25 temporal properties, such as a temporal envelope, of the at least part of the audio signal, 
transforming the prediction coefficients into a set of times representing the prediction 
coefficients, and including the set of times in the encoded signal. Note that times without any 
amplitude information suffice to represent the prediction coefficients. 
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Although a temporal shape of a signal or a component thereof can also be 
directly encoded in the form of a set of amplitude or gain values, it has been the inventor's 
insight that higher quality can be obtained by using predictive coding to obtain prediction 
coefficient? which represent temporal properties such as a temporal envelope and 
transforming these prediction coefficients to into a set of times. Higher quality can be 
obtained because locally (where needed) higher time resolution can be obtained compared to 
fixed time-axis technique. The predictive coding may be implemented by using the amplitude 
response of an LPC filter to represent the temporal envelope. 

It has been a further insight of the inventors that especially the use of a time 
domain derivative or equivalent of the Line Spectral Representation is advantageous in 
coding such prediction coefficients representing temporal envelopes, because with this 
technique times or time instants are well defined which makes them more suitable for further 
encoding, Therefore* with this aspect of the invention, an efficient coding of temporal 
properties of at least part of an audio signal is obtained, attributing to a better compression of 
the at least part of an audio signal. 

Embodiments of the invention can be interpreted as using an LPC spectrum to 
describe a temporal envelope instead of a spectral envelope and that what is time in the case 
of a spectral envelope, now is frequency and vice versa, as shown in the bottom part of Pig. 
2. This means that using a Line Spectral Representation now results in a set of times or time 
instances instead of frequencies. Note that in this approach times are not fixed at 
predetermined intervals on tihe time-axis, but that the times themselves represent the 
prediction coefficients. * ■ - 

The inventors realized that when using overlapping frame analysis/synthesis 
for the temporal envelope, redundancy in the Line Spectral Representation at the overlap can 
be exploited. Embodiments of the invention exploit this redundancy in an advantageous 
manner. 

The invention and embodiments thereof are in particular advantageous for the 
coding of a temporal envelope of a noise component in the audio signal in a parametric audio 
coding schemes such as disclosed in WO 01/69593-A1. In such a parametric audio coding 
scheme* an audio signal may be dissected into transient signal components* sinusoidal signal 
components and noise components. The parameters representing the sinusoidal components 
may be amplitude, frequency and phase. For the transient components the extension of such 
parameters with an envelope description is an efficient representation, 
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Note that the invention and embodiments thereof can be applied to the entire 
relevant frequency band of the audio signal or a component thereof; but also to a smaller 
frequency band. 

These and other aspects of the invention will be apparent from the elucidated 
5 wilkrefeenc#to1heaccora^ 

In the drawings: 

Fig. 1 shows an example of an LPC spectrum with 8 poles with corresponding 
8 Line Spectral Frequencies according to prior art; 
10 Fig. 2 shows (top) using LPC such that H(z) represents a frequency spectrum, 

(bottom) using LPC such that H(z) represents a temporal envelope; 

Fig. 3 shows a stylized view of exemplary analysis/synthesis windowing; 

Fig. 4 shows an example sequence of LSF times for two subsequent frames; 

Fig, 5 shows matching of LSF times by shifting LSF times in a frame k 
1 5 relative to a previous frame k-1 ; 

Fig. 6 shows weighting functions as Amotion of overlap; and 

Fig. 7 shows a system according to an embodiment of the invention. 

The drawings only show those elements that axe necessary to understand the 
embodiments of the invention, 

20 

Although the below description is directed to the use of an LPC filter and the 
calculation of time domain derivatives ox equivalents of LSFs, the invention is also 
applicable to other filters and representations which fall within the scope of the claims. 

Fig, 2 shows how a predictive filter such as an LPC filter can be used to 

25 describe a temporal envelope of an audio signal or a component thereof* In order to be able to 
use a conventional LPC filter, the input signal is first transformed from time domain to 
frequency domain by e.g. a Fourier Transform. So in fact, the temporal shape is transformed 
in a spectral shape which is coded by a subsequent conventional LPC filter which is normally 
used to code a spectral shape. The LPC filter analysis provides prediction coefficients which 

3 0 represent the temporal shape of the input signal. There is a trade-off between time-resolution 
and frequency resolution. Say that e.g. the LPC spectrum would consist of a number of very 
sharp peaks (sinusoids). Then the auditory system is less sensitive to time-resohitlon changes, 
thus less resolution is needed, also the other way around, e.g. within a transient the resolution 
of the frequency spectrum does not need to be accurate. In this sense one could see this as a 
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combined coding, Hie resolution of the time-domain is dependent on the resolution of the 
frequency domain and vice versa. One could also employ multiple LPC curves for the time- 
domain estimation, e.g. a low and a high frequenoy band, also here the resolution could be 
dependent on the resolution of the frequency estimation etc, this could thus be exploited. 
5 An LPC filter H(z) can generally be described ast 

H(z\ = - b \ - 

A{z) i 4- ofT 1 + a^z' 2 -k.. -b a m z" m 

The coefficients a^vdih i running from 1 to m, are the prediction filter coefficients resulting 

from the LPC analysis. The coefficients a* determine H(z), 

To calculate the time domain equivalents of the LSFs, the following procedure 
10 can be used. Most of this procedure is valid for a general ail-pole filter H(:z), so also for 

frequency domain. Other procedures known for deriving LSFs in the frequency domain can 

also be used to calculate the time domain equivalents of the LSFs. 

The polynomial A(z) is split into two polynomials P(z) and Q(z) of order . 

The polynomial P(z) is formed by adding a reflection coefficient (in lattice filter foim) of +i 
15 toA(z) 9 Q(z) is formed by adding a reflection coefficient of -1. There's a recurrent relation 

between the LPC filter in the direct form (equation above) and the lattice form: 

400 « + k^r 1 A i ^(z ml ) 

with 1^1,2, „.>m, Ao(z)—l and k\ the reflection coefficient. 
20 The polynomials P(z) and Q(z) are obtained byi 
Q{z)^A m {z)-z<^A m ^) 

The polynomials P<^ = i^^^^ ~ l+gis^+qj***. 

z -fi»+0 obtained in this way are even symmetrical and anti-symmetrical: 

25 Pl^Pm qj^-tJm 



30 Some important properties of these polynomials; 

- All zeros of P(z) and Q(z) are on the unit circle in the z-plane, 
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_ The zeros of P(z) and Q(z) are interlaced on the unit circle and do not overlap. 
- Minimum phase property of A(z) is preserved after quantization guaranteeing stability of 

Both polynoxnialp P(z) and Q(z) have m+1 zeros. It can be easily seen that z--l and z=l are 
5 always a zero in P(z) or Q(z). Therefore they can be removed by dividing by 1+z" 1 and 1-z' 1 . 
If m is even this leads to; 



gCg) 



If m is odd: 
10 n 



The zeros of the polynomials P '(z) and Q '(^ are bow described by s,W because the LPC 
filter is applied in the temporal domain- The 2eros of the polynomials P *(z) and Q '(z) are thus 
fully characterized by their time t 9 which runs from 0 to n over a frame, wherein 0 
15 corresponds to a start of the frame and to an end of that frame, which frame can actually 
have any practical length, e.g. 10 or 20 ms. The times / resulting from this derivation can be 
interpreted as time domain equivalents of the line spectral frequencies, which times are 
ftjrtkej. called LSF times herein. To calculate the actual LSF times* the roots of P '(si) and 
Q '( z ) have to be calculated. The different techniques that have been proposed in 0>],|!1O].[1 1] 

20 can also be used in the present context 

Fig. 3 shows a stylized view of an exemplary situation for analysis and 
synthesis of temporal envelopes. At each frame k a, not necessarily rectangular, window is 
used to analyze the segment by LPC. So for each frame, after conversion, a set of iVLSF 
times is obtained. Note that H in principal does not need to be constant, although in many 

25 cases this leads to a more efficient representation. In this embodiment we assume that the 
LSF times are uniformly quantized, although other techniques like vector quantization could 

also be applied here. 

Experiments have shown that in an overlap area as shown in Fig- 3 there is 
often redimdanay between the LSF times of frame h-I with those of frame jfc. Reference is 
30 also made to Figs, 4 and S. In embodiments of the invention which are described below, this 
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redundaney is exploited to more efficiently encode the LSF times, ^hich helps to better 
compress the at least part of an audio signal. Note that Figs. 4 and 5 show usual cases 
wherein the LSF times of frame A: in the overlapping area are not identical but however rather 
cloge to the LSF times in frame k-L 

5 

First embodiment using overlapping flames 

In a first embodiment using overlapping frames it is assumed that the 
differences between LSF times of overlapping areas can be s perceptually., neglected or result 
in an acceptable loss in quality. For a pair of LSF times, one in the frame krl and one in the 

1 0 frame k, a derived LSF time is derived which is a weighted average of the LSF times in the 
pair. A weighted average in this application is to be construed as including the case where 
only one out of the pair of LSF times is selected. Such a selection can be interpreted as a 
weighted average wherein the weight of the selected LSF time is one and the weight of the 
non-gelected time is sero. It is also possible that both LSF times of the pair have the same 

15 weight 

For example, assume LSF times { h h, h In } for frame krl and { !& l u l 2 , 
1 1 > j lu } for frame k as shown in Fig. 4- The LSF times in frame k are shifted such that a 
certain quantization level / is in the same position in each of the two frames. Now assume 
that there are three LSF times in the overlapping area for each frame, as is the case for Fig. 4 
20 and Fig. 5. Then the following corresponding pairs can be formed: {Ahau fy* , h t k , 1^ 
1 lu this embodiment, a new set of three derived LSF times is constructed based on the 
two original sets of three LSF times. A practical approach is to just take the LSF times of 
frame M (or k)> and calculate the LSF times of frame k (or k~l) by simply shifting the LSF 
times of frame h-1 (or fc) to align the frames in time. This shifting is performed in both the 
encoder and the decoder. In the encoder the LSFs of the right frame k are shifted to match the 
ones in the left frame k-L This is necessary to look for pairs and eventually determine the 
weighted average. 

In preferred embodiments, the derived time or weighted average is encoded 
into the bit-stream as a 'representation level' which is an integer value e.g. from 0 until 255 
30 (8 bits) representing 0 until pi. In practical embodiments also Huffman coding is applied. For 
a fir$t frame the first LSF time is coded absolutely (no reference point), all subsequent LSF 
times (including the weighted ones at the end) are coded differentially to their predecessor. 
Now, say frame k could make use of the 'trick' using the last 3 LSF times of frame k-L For 
decoding, frame k then takes the last three representation levels of frame krl (which are at the 



25 
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end of the region 0 until 255) and shift them back to its own time-axis (at the be ginning of 
the region 0 until 255)- All subsequent LSF times in frame k would be encoded differentially 
to their predecessor starting with the representation leval (on the axis of frame k) 
corresponding to the last LSF in the overlap area. In case frame k could not make use of the 
5 'trick 5 the first LSF time of frame k would be coded absolutely and all subsequent LSF times 
of frame k differential to their predecessor. 

A practical approach is to take averages of each pair of corresponding LSF 
times, e,g. (fa**./ + Au)/2,( ImM + hj)f2 and + &,/)/2, 

An even more advantageous approach takes into account that the windows 
10 typically show a fade-in/fade-out behavior as shown in Fig. 3. In this approach a weighted 
mean of each pair is calculated which gives perceptually better results. The procedure for this 
is as follows. The overlapping area corresponds to the area (rc-r a tc). Weight functions are 
derived as depicted in Fig. 6, The weight to the times of the left frame krl for each pair 
separately is calculated as: 

15 w M 



where l^an is the mean (average) of a pair, e.g.: l m tm a ( few + h*) * 2 * 
The weight for frame k is calculated as w^l-wt-i. 
The new LSF times are now calculated as: 

, ■ — . t 

where 1 M and h form a pair. Finally the weighted LSF times are uniformly quantized. 

As the first frame in a bit-stream has no history, the first frame of LSF times 
always need to be coded without exploitation of techniques as mentioned above, This may be 
done by coding the first LSF time absolutely using Huffman coding, and all subsequent 

25 " values differentially to their predecessor within a frame using a fixed Hufiman table. All 
.frames subsequent to the first frame can in essence make advantage of an above technique. 
Of course such a technique is not always advantageous. Think for instance of a situation 
where there are an equal number of LSF times in the overlap area for both frames, but with a 
very bad match. Calculating a (weighted) mean might then result in perceptual deterioration. 

30 Also the situation where in frame fc-J the number of LSF times is not equal to the number of 
LSF times in frame k is preferably not defined by an above technique. Therefore for each 
frame of LSF times an indication, such as a single bit, is included in the encoded signal to 
indicate whether or not an above technique is used, i.e. should the first number of LSF times 
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30 



be retrieved from the previous frame or are they in the bit-stream? For example, if the 
indicator bit is 1 : the weighted LSF times are coded differentially to their predecessor in 
frame JW P for frame fcthe first number of LSF times in the overlap area are derived from the 
LSFs in frame k-1. If the indicator bit is 0, the first LSF time of frame k is coded absolutely, 
all following LSFs are coded differentially to their predecessor. 

In a practical embodiment, the LSF time frames are rather long, e,g« 1440 
samples at 44.1kHz; in this case only around 30 bits per second are needed for this extra 
indication bit. Experiments showed that most of the frames could make uge of the above 
technique advantageously, resulting in net bit savings per frame. 

Further embodiment using overlapping frames 

According to a further embodiment of the invention, the LSF time data is loss- 
lessly encoded. So instead of merging the overlap-pairs to single LSF times, the differences 
of the LSF times in a given frame are encoded with respect to the LSF times in another 
frame. So in the example of Figure 3 when the values U until l N are retrieved of frame the 
first three values k until 1$ from frame k are retrieved by decoding the differences (In the bit- 
stream) to /jw, fo./, It? of frame k-1 respectively. By encoding an LSF time with reference to 
an LSF time in an other frame which is closer in time than any other LSF time in the other 
frame, a good exploitation of redundancy is obtained because times can best be encoded with 
reference to closest times. As their differences are usually rather small, they can be encoded 
quite efficiently by using a separate Huffman table. So apart from the bit denoting whether or 
not to use & technique as described in the first embodiment, for this particular example also 
the differences /o,*- fau-u h,ir fa-hh-n hk- hfiri are placed in the bit-stream, in the case the 
first embodiment is not used for the overlap concerned. 

Although less advantageously, it is alternatively possible to encode differences 
relative to other LSF times in the previous frame. For example, it is possible to only code the 
difference of the first LSF time of the subsequent frame relative to the last LSF time of the 
previous frame and then encode each subsequent LSF time in the subsequent frame relative 
to the preceding LSF time in the same frame, e.g. as follows: for frame k*h /jyij- fa- Im 
and subsequently for* frame k fa&h hfrhk etc. 



System description 

Fig. 7 shows a system according to an embodiment of the invention. The 
system comprises an apparatus 1 for transmitfciixg or recording an encoded signal [S]. The 
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apparatus 1 comprises an input unit 1 0 far receiving at least part of an audio signal S, 
preferably a noise component of the audio signal, The input unit 10 may be an antenna, 
microphone, network connection, etc. The apparatus 1 further comprises an encoder 1 1 for 
encoding the signal S according to an above described embodiment of the invention (see in 
5 particular Figs. 4, 5 and 6) in order to obtain an encoded signal. It is possible that the input 
unit 10 receives a full audio signal and provides components thereof to other dedicated 
encoders- The encoded signal is furnished to an output unit 12 which transforms the encoded 
audio signal in a bit-stream [S] having a suitable format for transmission or storage via a 
transmission medium or storage medium 2. The system further comprises a receiver or 

10 reproduction apparatus 3 which receives the encoded signal [S] in an input unit 30. The input 
unit 30 furnishes the encoded signal [S] to the decoder 31. The decoder 31 decodes the 
encoded signal by performing a decoding process which is substantially an inverse operation 
of the encoding in the encoder 1 1 wherein a decoded signal 9 * is obtained which corresponds 
to the original signal S except for those parts which were lost during the encoding process, 

15 The decoder 3 1 furnishes the decoded signal S* to an output unit 32 that provides the decoded 
signal S*. The output unit 32 may be reproduction unit such as a speaker for reproducing the 
decoded signal S% The output unit 32 may also be a ixansmitter for further transmitting the 
decoded signal S" for example over an in-home network, etc. In the case the signal & 9 is 
reconstruction of a component of the audio signal such as a noise component, then the output 

20 unit 32 may include combining means for combining the signal S* with other reconstructed 
components in order to provide a fhll audio signal. 

Embodiments of the invention may be applied in, inter alia, Internet 
distribution, Solid State Audio, 3G terminals, GPRS and commercial successors thereof. 

It should be noted that the above-mentioned embodiments illustrate rather than 

25 limit the invention, and tbat those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. This 
word 'comprising 5 does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 

30 distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 
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CLAIMS: 



1 B A method of coding at least part of an audio signal in order to obtain an 

encoded signal, the method comprising the steps of: 

predictive coding the at least part of the audio signal in order to obtain 
prediction coefficients which represent temporal properties, such as a temporal envelope, of 

5 the at least part of the audio signal; 

traivsfoxming the prediction coefficients into a set of times representing the 

prediction coefficients; and 

including the set of times in the encoded signal. 

10 2. A method as claimed in claim 1, wherein the predictive coding is performed 

by a using a filter and wherein the prediction coefficients are filter coefficients, 

3 1 A method as claimed in claim 1 or 2, wherein the predictive coding is a linear 

predictive coding, 

15 

4, A method as claimed in any of the previous claims, wherein prior to the 
predictive coding step a time domain to frequency domain transform is performed on the at 
least part of an audio signal in order to obtain a frequency domain signal, and wherein the 
predictive coding step is performed on the frequency domain signal rather than on the at least 

20 part of an audio signal. 

5. A method as claimed in any of the previous claims, wherein the times are time 
domain derivatives or equivalents of line spectral frequencies. 

25 6. A method as claimed in any of the previous claims, wherein the at least part of 

an audio signal is segmented in at least a first frame and a second frame and wherein the first 
frame and the second frame have an overlap including at least one time of each frame. 
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7. A method as claimed in claim 6, wherein for a pair of times consisting of one 

time of the first frame in the overlap and one time of the second frame in the overlap, a 
derived time is included in the encoded signal, which derived time is a weighted average of 
the one time of the first frame and the one time of the second frame. 



8. A method as claimed in claim 7, wherein the derived time is equal to a 

selected one of the times of the pair of times. 



9. A method as claimed in claim 7, wherein a time closer to a boundary of a 
10 frame has lower weight than a time Anther away from said boundary, 

10. A method as claimed in claim 6, wherein a given time of the second frame is 
differentially encoded with respect to a time In the first frame. 



15 11. A method as claimed in claim 10, wherein the given time of the second frame 

is differentially encoded with respect to a time in the first frame which is closer in time to ttxe 
given time in the second frame than any other time in the first frame, 

12. A method as claimed in any of the claims 7, 8, 9, 10 or 1 1 , wherein further an 
20 indicator, such as a single bit, is included in the encoded signal, which indicator indicates 

whether or not the encoded signal includes a derived time in the overlap to which the 
indicator relates. 

13. A method as claimed in any of the claims 7, 8, 9, 10, 11 or 12, wherein further 
25 an indicator, such as a single bit, is included in the encoded signal, which indicator indicates 

the type of coding which is used to encode the times or derived times in the overlap to which 
the indicator relates. 



14. An encoder for coding at least part of an audio signal in order to obtain an 

30 encoded signal, the enco der comprising: 

means for predictive coding the at least part of the audio signal in order to 
obtain prediction coefficients which represent temporal properties, such as a temporal 
envelope, of the at least part of the audio signal; 
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means for transforming the prediction coefficients into a set of times 
representing the prediction coefficients; and 

means for including flie set of times in the encoded signal, 

5 15. An enooded signal representing at least part of an audio signal, the encoded 

signal including a set of times representing prediction coefficients which ptediction 
coefficients represent temporal properties, such as a temporal envelope, of the at least part of 
the audio signal. 

10 16. An encoded signal as claimed in claim 15„ wherein the times are related to at 

least a first frame and a second frame in the at least part of an audio signal and wherein the 
first frame and the second frame have an overlap including at least one time of each frame, 
and wherein the encoded signal includes at least one derived time, which derived time is a 
weighted average of the one time of the first frame and the one time of the second frame. 

IS 

17. An encoded signal as claimed in claim 1 6, the encoded signal further 

comprising an indicator, such as a single bit, which indicator indicates whether or not the 
encoded signal includes a derived time in the overlap to which the indicator relates. 

20 1 8. A storage medium having stored thereon an encoded signal as claimed in any 

of the claims 15, 16, or 17. 

19, A method of decoding an encoded signal representing at least part of an audio 
signal, the encoded signal including a set of times representing prediction coefficients which 

25 prediction coefficients represent temporal properties, such as a temporal envelope, of the at 
least part of the audio signal, the method comprising the steps of; 

deriving the temporal properties, such as the temporal envelope, from the set 
of times and using these temporal properties in order to obtain a decoded signal, and 

providing the decoded signaL 

30 

20, A method of decoding as claimed in claim 1 9, wherein the method comprises 
the step of transforming the set of times in order to obtain tie prediction coefficients, and 
wherein the temporal properties are derived from the prediction coefficients rather than from 
the set of times. 
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21 > A method of decoding as claimed in claim 19 or 20, wherein the times are 

related to at least a first frame and a second frame in the at least part of an audio signal and 
wherein the first frame and the second frame have an overlap including at least one time of 
5 each frame, and wherein the encoded signal includes at least one derived time, which derived 
time is a weighted average of a pair of times consisting of one time of the first frame in the 
overlap and one time of the second frame in the overlap in the original at least part of an 
audio signal, wherein tbe method comprises farther the step of using the at least one derived 
time in decoding the first frame as well as in decoding the second frame. 

10 

22. A method of decoding as claimed in claim 2 1 , wherein the encoded signal 

further comprising an indicator, such as a single bit, which indicator indicates whether or not 
the encoded signal includes a derived time in the overlap to which the indicator relates, the 
method further comprising the steps of: 
1 5 obtaining the indicator from the encoded signal, 

only in the case that the indicator indicates that the overlap to which the 
indicator relates does include a derived time, performing the step of using the at least one 
derived time in decoding the first frame as well as in decoding the second frame, 

20 23 . A deooder for decoding an encoded signal representing at least part of an 

audio signal, the encoded signal including a set of times presenting prediction coefficients 
which prediction coefficients represent temporal properties, such a$ a temporal envelope, of 
the at least part of the audio signal, the method comprising the steps of: 

deriving the temporal properties, such as the temporal envelope, from the set 
25 of time and using these temporal properties in order to obtain a decoded signal, and 

providing the decoded signal. 

24. A transmitter comprising: 

an input unit for receiving at least part of an audio signal, 
30 an encoder as claimed in claim 14 for encoding the at least part of an audio 

signal to obtain an encoded signal, and 

an output unit for transmitting the encoded signal, 



25 



A receiver comprising: 
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an input trait for receiving an encoded signal representing at least part of an 

audio signal, 

a decoder as claimed in claim 23 for decoding the encoded signal to obtain a 
decoded signal, and 

an output unit for providing the decoded signaL 

26. A system comprising a transmitter as claimed in claim 24 and a receiver as 

claimed in claim 25. 
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ABSTRACT: 



According to a first aspect of the invention, at least part of an audio signal is 
coded in order to obtain an encoded signal, the coding comprising predictive coding the at 
least part of the audio signal in order to obtain prediction coefficients which represent 
temporal properties,, such as a temporal envelope, of the at least part of the audio signal, 
5 transforming the prediction coefficients into a set of times representing the prediction 

coefficients, and including the set of times in the encoded signal. Especially the use of a time 
domain derivative or equivalent of the Line Spectral Representation is advantageous in 
coding such prediction coefficients, because with this technique times or time instants are 
well defined which makes them more suitable for further encoding. For overlapping frame 
1 0 analysis/synthesis for the temporal envelope, redundancy in the Line Spectral Representation 
at the overlap can be exploited- Embodiments of the invention exploit this redundancy in an 
advantageous manner- 
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